AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Reinforcement Learning Reasoning

# Reinforcement Learning Reasoning

Acereason Nemotron 14B GGUF
Other
A math and programming reasoning model trained with reinforcement learning, excelling in multiple benchmark tests
Large Language Model Transformers English
A
unsloth
1,417
4
Open Reasoner Zero 7B
MIT
Open Reasoner Zero is an open-source solution for large-scale reinforcement learning based on foundational models, focusing on scalability, simplicity, and ease of use for large-scale reasoning-oriented reinforcement learning.
Large Language Model Transformers
O
Open-Reasoner-Zero
776
28
Deepseek R1 Zero
MIT
DeepSeek-R1 is the first-generation reasoning model developed by DeepSeek, trained through reinforcement learning, excelling in mathematics, coding, and reasoning tasks.
Large Language Model Transformers
D
deepseek-ai
4,034
905
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase